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Abstract 



The measuring process is an external intervention in the dynamics of a quantum system. It involves 
a unitary interaction of that system with a measuring apparatus, a further interaction of both with an 
unknown environment causing decoherence, and then the deletion of a subsystem. This description of 
the measuring process is a substantial generalization of current models in quantum measurement theory. 
In particular, no ancilla is needed. The final result is represented by a completely positive map of the 

C^ ' quantum state p (possibly with a change of the dimensions oi p). A continuous limit of the above process 

^ , leads to Lindblad's equation for the quantum dynamical semigroup. 
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I. INTRODUCTION 

The measuring process [^ ||] is the interface of the classical and quantum worlds. The classical world 
has a description which may be probabilistic, but in a way that is compatible with Boolean logic. In the 
quantum world, probabilities result from complex amplitudes that interfere in a non-classical way. In this 
article, the notion of measurement is extended to a more general one: an intervention. An intervention 
has two consequences. One is the acquisition of information by means of an apparatus that produces a 
record. This step is called a measurement. Its outcome, which is in general unpredictable, is the output 
of the intervention. The other consequence is a change of the environment in which the quantum system 
will evolve after completion of the intervention. For example the intervening apparatus may generate a 
new Hamiltonian that depends on the recorded result. In particular, classical signals may be emitted for 
controlling the execution of further interventions. In the second part of this article 0, these signals will 
be limited to the velocity of light, so as to obtain a relativistic version of quantum measurement theory. 

Interventions are mathematically represented by completely positive maps. Their properties are dis- 
cussed in Sect. II, where a detailed dynamical description is given of the measuring process: it involves 
unitary interactions with a measuring apparatus and with an unknown environment that causes decoher- 
ence, and then the optional deletion of a subsystem. The Hilbert space for the resulting quantum system 
may have a different number of dimensions than the initial one. Thus, a quantum system whose descrip- 
tion starts in a given Hilbert space may evolve into a set of Hilbert spaces with different dimensions. If 
one insists on keeping the same Hilbert space for the description of the entire experiment, with all its 
possible outcomes, this can still be achieved by defining it as a Fock space. 

The term "detector" will frequently appear in this article and in the following one. It means an 
elementary detecting element, such as a bubble in a bubble chamber, or a small segment of wire in a 
wire chamber. Note that in such a detector, the time required for the irreversible act of amplification 
(the formation of a microscopic bubble, or the initial stage of the electric discharge) is extremely brief, 
typically of the order of an atomic radius divided by the velocity of light. Once irreversibility has set 
in, the rest of the amplification process is essentially classical. It is noteworthy that the time and space 
needed for initiating the irreversible processes are incomparably smaller than the macroscopic resolution 
of the detecting equipment. 

An intervention is described by a set of parameters that include the time at which the intervention 
occurs. Interventions of finite duration can also be considered [0 and will be briefly discussed. For a 
relativistic treatment (in the following article), we shall need the location of the intervention in spacetime, 
referred to an arbitrary coordinate system. In any case, we have to specify the speed and orientation of 
the apparatus in the coordinate system that we are using and various other input parameters that control 
the apparatus, such as the strength of a magnetic field, or that of an RF pulse used in the experiment, and 
so on. The input parameters are determined by classical information received from past interventions, 
or they may be chosen arbitrarily by the observer who prepares that intervention, or by a local randoin 
device acting in lieu of the observer. 

A crucial physical assumption is that there exists an objective time ordering of the various interventions 
in an experiment. There are no closed causal loops. This time ordering defines the notions earlier and 
later. The input parameters of an intervention are deterministic (or possibly stochastic) functions of the 
parameters of earlier interventions, but not of the stochastic outcomes resulting from later interventions. 
In such a presentation, there is no "delayed choice paradox" [Q (there can be a delayed choice, of course, 
but no paradox is associated with it). 

The word "measurement" is a bit misleading, because it suggests that there exists in the real world 
some unknown property that we are measuring [|6| . This term was banned by Bell [[7| , though for a different 
reason: Bell pointed out that the notion of measurement, or observation, was logically inconsistent in a 
world whose description is purely quantum mechanical. However, the approach followed in the present 
article does not comply with Bell's desiderata. It explicitly associates classical inputs and outputs with 
each intervention [g|, ||. 

The probabilities of the various outcomes of an intervention can be predicted by using a suitable 
theory, such as quantum theory. Besides these outcomes, there may also be other output parameters: 
there may be modifications of the physical environment depending on which outcome arose, and the 
intervening apparatus may emit classical signals with instructions for setting up later interventions. As a 



concrete example, consider the quantum teleportation scenario [pij[ . The first intervention is performed 
by Alice: she has two spin-^ particles and she performs on them a test with four possible outcomes. When 
Alice gets the answer, she emits a corresponding signal, which becomes an input for Bob's intervention: 
the latter is one of four unitary transformations that can be performed on Bob's particle. 

Quantum mechanics is fundamentally statistical uM- In the laboratory, any experiment has to be re- 
peated many times in order to infer a law; in a theoretical discussion, we may imagine an infinite number 
of replicas of our gedankenexperiment, so as to have a genuine statistical ensemble. The various experi- 
ments that we consider all start in the same way, with the same initial state po, and the first intervention 
is the same. However, later stages of the experiment may involve different types of interventions, possibly 
with different spacetime locations, depending on the outcomes of the preceding events. Yet, assuming 
that each intervention has only a finite number of outcomes, there is for the entire experiment only a 
finite number of possible records. (Here, the word "record" means the complete list of outcomes that 
occurred during the experiment. I do not want to use the word "history" which has acquired a different 
meaning in the writings of some quantum theorists.) 

Each one of these records has a definite probability in the statistical ensemble. In the laboratory, 
experimenters can observe its relative frequency among all the records that were obtained; when the 
number of records tends to infinity, this relative frequency is expected to tend to the true probability. 
The role of theoretical physics is to predict the probability of each record, given the inputs of the 
various interventions (both the inputs that are actually controlled by the local experimenter and those 
determined by the outputs of earlier interventions). Note that each record is objective: everyone agrees 
on what happened (e.g., which detectors clicked). Therefore, everyone agrees on what the various relative 
frequencies are, and the theoretical probabilities are also the same for everyone. 

The "detector clicks" are the only real thing we have to consider. Their observed relative frequencies 
are objective data. On the other hand, wave functions and operators are nothing more than abstract 
symbols. They are convenient mathematical concepts, useful for computing quantum probabilities, but 
they have no real existence in Nature |12| . Note also that while interventions are localized in spacetime, 
quantum systems are pervasive. In each experiment, irrespective of its history, there is only one quantum 
system. The latter typically consists of several particles or other subsystems, some of which may be 
created or annihilated at the various interventions. 

The next section describes the quantum dynamics of the measuring process which is an essential 
part of each intervention. The role of decohcrcncc, due to an inavoidable interaction with an unknown 
environment, is discussed in Sect. HI. The final result, Eq. (pO|), will be extensively used in the following 
article. The right hand side of that equation contains operators ^^m which are typically represented by 
rectangular matrices. Some of their mathematical properties (in particular factorability) are discussed in 
Sect. IV. 

Decoherence, whose role is essential in the measuring process, is a stochastic phenomenon similar to 
Brownian motion. However, when seen on a coarse time scale, it is possible to consider it as a continuous 



process. This continuous approximation leads to the Lindblad equation |13 , which is derived in a simple 
and novel way in Sect. V. 

II. THE MEASURING PROCESS 

The measuring process involves several participants: the physical system under study, a measuring 
apparatus whose states belong to macroscopically distinguishable subspaces, and the "environment" 
which includes unspecified degrees of freedom of the apparatus and the rest of the world. These unknown 
degrees of freedom interact with the relevant ones, but they are not under the control of the experimenter 
and cannot be explicitly described. Our partial ignorance is not a sign of weakness. It is fundamental. If 
everything were known, acquisition of information would be a meaningless concept. 

In order to keep the discussion as general as possible, I do not introduce here any "ancilla," contrary 
to current fashion. This omission is not an oversight. It is intentional and deserves a brief explanation. In 
the early years of quantum mechanics, von Neumann wrote a rigorous mathematical treatise El which had 
a lasting infiuence. According to von Neumann, the various outcomes of a measurement correspond to a 
complete set of orthogonal projection operators in the Hilbert space of the quantum system under study. 
It was later realized that von Neumann's approach was too restricted, because the measuring process 



may have more distinct outcomes than the number of dimensions of that Hilbert space. The appropriate 
formahsm is that of a positive operator valued measure (POVM) llj, 03]. Namely, the various outcomes 
of the measurement correspond to positive operators -E^, which sum up to the unit operator but need 
not commute. 

This raised a new problem: the actual implementation of a given POVM. In the final section of his 
book, von Neumann had formally shown how to construct a Hamiltonian that generated a dynamical evo- 
lution of the type required to obtain a projection valued measure (PVM). This was a mathematical proof 
of existence, namely, quantum dynamics was compatible with the structure of a PVM. Is it compatible 
with a more general POVM? This question was answered by Helstrom jlfl] who converted the problem 
of implementation of a POVM into that of an ordinary von Neumann measurement, by introducing an 
auxiliary quantum system that he called ancilla (the Latin word for housemaid) . By virtue of Neumark's 
theorem |1^, any POVM can be obtained from a PVM applied to a composite system that consists of 
the original system and an ancilla having a sufhcient number of dimensions. This provides a formal proof 
of existence, but in real life this is usually not how measurements are actually performed. Anyway, even 
if an ancilla is used according to Helstrom's protocol, we may as well consider it as part of the measuring 
apparatus. Therefore the following description of the measuring process will not involve any ancilla, and 
yet it will explicitly show how any POVM can be implemented by a unitary interaction of the quantum 
system with a suitable apparatus. 

To simplify the notations, it will be assumed that finite dimensional Hilbert spaces are sufficient for 
describing the quantum system under study, the apparatus, and even the environment. Moreover, the 
initial states pi of the system and the apparatus are assumed pure. Initially mixed states would be a 
more realistic assumption, but since they can always be written as convex combinations of pure pi, their 
use would not bring any essential change in the discussion below. 

Let a set of basis vectors for the system under study be denoted as {|s)}. The initial state of that 
system is a linear combination, J-^q) = ^ Cs|s), with complex coefficients Cg- Let |^) be the initial state of 
the apparatus. In the first step of the measuring process, which may be called a "premeasurement" |18|] , 
the apparatus interacts unitarily with the quantum system and becomes entangled with it, so that they 
effectively become a single composite system C: 

5^c,|5)®|A)^^c,C/,,|A), (1) 

s s,\ 

where {|A)} is a complete basis for the states of C. It is the choice of Us\ that determines which property 
of the system under study is correlated to the apparatus, and therefore is measured. Unitarity implies 
that 

Y^UsxU:^^5st, (2) 

A 

or [/t/^ — 1, where 1 is the unit matrix in the Hilbert space of the original quantum system. The U 
matrix is not square: it has fewer rows than columns, because we have considered only a single initial 
state of the apparatus, namely \A). If we had introduced a complete set of states for the apparatus, 
labelled \B), |C), etc., then U would have been a unitary matrix satisfying 
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UsA,\ U^g ^ = 6 St Sab- (3) 



Our freedom of choosing the required unitary matrix UsA,x is equivalent to the freedom of choice of an 
interaction Hamiltonian in the von Neumann formalism. 

The apparatus itself is an utterly complicated system, and some radical assumptions are needed in 
order to proceed with explicit calculations. The assumptions below are not as drastic as those commonly 
used in quantum measurement theory, yet they ought to be clearly spelled out. Let us assume that 
the composite system C can be fully described by the theory. Its complete description involves both 
"macroscopic" variables and "microscopic" ones. The difference between them is that the microscopic 
degrees of freedom can be considered as adequately isolated from the environment for the duration of the 
experiment, so that their evolution is in principle perfectly controlled, while the macroscopic ones cannot 



be isolated from the unknown environment and the dynamical evolution cannot be completely predicted. 
Statistical hypotheses are required in order to make plausible predictions, as explained below. Any other 
degrees of freedom of the apparatus, for which no explicit description is provided, are considered as part 
of the environment. 

An essential property of the composite system C, which is necessary to produce a meaningful mea- 
surement, is that its states form a finite number of orthogonal subspaces which are distinguishable by 
the observer. These subspaces are similar to, but more general than, Zurek's "pointer basis" p9| | which 
is a preferred basis for the apparatus. Here, we consider orthogonal subspaces of the composite system 
C, which may have different numbers of dimensions. For example, a particle detector may have just two 
such subspaces: "ready to fire" and "discharged" (obviously, the latter has many more states than the 
former). Each macroscopically distinguishable subspace corresponds to one of the outcomes of the inter- 
vention and defines a POVM element E'^, given explicitly by Eq. (m) below. The labels fi are completely 
arbitrary, for example they may be the labels printed on the various detectors. The initial state of C, 
namely j-^o) ® |^): lies in the subspace that corresponds to the null outcome (no detection). 

Obviously, the number of different outcomes fj, is far smaller than the dimensionality of the composite 
system C. Let us introduce a complete basis, {\fi, S,)}, where fi labels a macroscopic subspace, as explained 
above, and ^ labels microscopic states in that subspace. We thus have 

{f^,£.W,v) ^Sf,i,6^,^. (4) 

Note that the various subspaces labelled /i may have different dimensions, that is, the range of indices ^ 
may depend on the corresponding fi. We shall henceforth write f/s/i{ instead of Us\. 
After the premeasurement, given by Eq. (|l]), the state of C is given by 

|^i) = ^c,t/,^^|A*,0. (5) 

The probability of obtaining outcome /x is the contribution of subspace jj, to the density matrix 1-01) (fAil- 
Explicitly, it is 

Y^ CsC*t Usi^i Ut^^ = Y^ pst {E^)ts , (6) 

where 

is a POVM element, defined in the Hilbert space of the system under study whose initial state was pst- 
Note that the matrices E'^ satisfy 

5](E^),.=^ [7,^4(7,;^ =(5,,, (8) 

by virtue of the unitarity property in Eq. (0). 

III. DECOHERENCE 

Up to now, the quantum evolution is well defined and it is in principle reversible. It would remain so if 
the macroscopic degrees of freedom of the apparatus could be perfectly isolated from their environment, 
and in particular from the "irrelevant" degrees of freedom of the apparatus itself. This demand is of 
course self-contradictory, since we have to read the result of the measurement if we wish to make any use 
of it. 

Let {|eQ,)} denote a complete basis for the states of the environment, and let le^j) be the state of the 
environment at the moment of the premeasurement. That state is of course unknown, but I temporarily 



assume that it is pure, and moreover that it is one of the basis states, in order to simphfy the notations. 
This pure initial state will later be replaced by a density matrix, 






Pu:\eui){eu:\, (9) 



with unknown random non- negative coefficients p^ . There is no loss of generality in assuming that pe is 
diagonal in the basis used for the states of the environment. This merely says that this basis was chosen 
in the appropriate way. 

Recall that states |/i,C) with different /i are macroscopically different, so that they interact with 
different environments. On the other hand, the labels ^ refer to microscopic degrees of freedom that are 
well protected from parasitic disturbances. This sharp dichotomous distinction between the two types of 
degrees of freedom is the only approximation that was made until now. 

The unitary interaction of C with the environment thus generates an evolution which is not under the 
control of the experimenter: 

\^J^,Cl®K)-^\^^,C)®^b^,^c.\ec.)■ (10) 

The coefficients h^tja are unknown except for normalization, 



Ei^, 
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/ioja I 



= 1. (11) 



They have no subscript i^ because the microscopic degrees of freedom do not interact with the environment, 
and they cannot mix different values of /i because the latter refer to macroscopic outcomes that are stable 
on the time scale of the experiment. One could also consider a more general evolution, where the right 
hand side of Eq. (nOf) would involve different subspaces p. This would mean that the measuring apparatus 
is actually disturbed by the environment. Such a process is called noise and is essentially different from 
the phenomenon of decoherence , whose occurence is explained below. Here it is assumed that no noise 
affects the measuring process. It's only the environment, whose microscopic degrees of freedom are not 
robust, that is disturbed by the apparatus (this is the mechanism causing decoherence) . Exactly how 
it is disturbed cannot be known, however we do know that macroscopically different states of C lead to 
different disturbances of the environment, hence the appearance of an index fi in the coefficients fo^ojQ. 
The final state with all the participating subsystems thus is 

IV'2) = ^ CsUm,,^bfj,^a\fJ',0'^\(^a)- (12) 

The final density matrix (still a pure state, for the sake of simple notations) is p = |'02)('02|- Explicitly, we 
see from Eq. (12) that the expression for p contains, among other things, operators |eQ)(e^| which refer 
to states of the environment. They are unknown and are considered as unknowable. The only operator 
acting on these states that we know to write is represented by the unit matrix SafS- Its meaning, in 
the laboratory, is that of complete ignorance. Therefore we can effectively replace the complete density 
matrix p by the reduced matrix obtained from it by ignoring the inaccessible degrees of freedom of the 
environment. That is, we replace the operators |eQ,)(e/3| which appear in p by (e^jeo,) = Sa/s, and we 
perform a partial trace on the indices that refer to the environment. 

The reduced density matrix thus contains expressions of the type ^^ b^^a bl,^^- It will now be shown 
that, after a reasonably short time has elapsed, we have 

/ , b^i^a O^^Q ~ O^iu- (lo) 

a 

The case p — v \s the normalization condition ( [ll|) due to unitarity. When p ^ v^ the rationale for arguing 
that the left hand side of (13) is very close to zero (and has a random phase) is that the environment has a 
huge number of states, N say, whose dynamics is chaotic. Therefore the scalar product of any two states 
such as ^ b^^^a\&a) and ^ &i/w/3|e^) that may result from Eq. ([l(]|), at any random time, is of the order of 



7V~^/^, because the components of a random state, in a randomly chosen basis, are of the order of N^^^^. 
The time that has to elapse to make Eq. (13) a good approximation is called the decoherence time, and 
it depends on how well the macroscopic degrees of freedom of the measuring apparatus are isolated from 
the environment. There may of course be large fluctuations on the left hand side of Eq. (13), akin to 
Poincare recurrences pQ], but this expression is almost always very close to zero \i ^ ^ v. 

The approximation becomes even better if instead of an ideal pure initial state \ej) for the environment, 
we take the more realistic density matrix given by Eq. (0). Instead of Eq. (13) we then have 

^Pu>^h^,^ahl^^~5^,^, (14) 

uj a. 

where the off-diagonal terms on the right hand side now are of order N~'^, rather than N~'^/'^ as before. 
It is plausible that the above argument can be made mathematically rigorous in the thermodynamic limit 
N -^ (X). 

It follows that states of the environment that are correlated to subspaces of C with different labels 
/i can safely be treated in our calculations as if they were exactly orthogonal. The resulting theoretical 
predictions will almost always be correct, and if any rare small deviation from them is ever observed, it 
will be considered as a statistical quirk, or an experimental error. The reduced density matrix thus is 
block-diagonal and all our statistical predictions are identical to those obtained for an ordinary mixture 
of (unnormalized) pure states 

where the statistical weight of each state is the square of its norm. This mixture replaces the pure state 
IV'i) in Eq. (0). This is the meaning of the term decoherence. From this moment on, the macroscopic 
degrees of freedom of C have entered into the classical domain ^, |2|] . We can safely observe them and 
"lay on them our grubby hands" p3]. In particular, they can be used to trigger amplification mechanisms 
(the so-called detector clicks) for the convenience of the experimenter. 

Note that all these properties still hold if the measurement outcome happens to be the one labelled 
/i = (that is, if there is no detector click). It does not matter whether this is due to an imperfection 
of the detector or to a probability less than 1 that a perfect detector would be excited. The state of the 
quantum system does not remain unchanged. It has to change to respect unitarity. The mere presence of 
a detector that could have been excited implies that there has been an interaction between that detector 
and the quantum system. Even if the detector has a finite probability of remaining in its initial state, 
the quantum system correlated to the latter acquires a different state [Q . The absence of a click, when 
there could have been one, is also an event and is part of the historical record. 

The final (optional) step of the intervention is to discard part of the composite system C. In the case 
of a von Neumann measurement, the subsystem that is discarded and thereafter ignored is the measuring 
apparatus itself. In general, it is a different subsystem: the discarded part may depend on the outcome ^ 
and in particular its dimensions may depend on /i. The remaining quantum system then also has different 
dimensions. We therefore introduce in the subspace ^ two sets of basis vectors \^i,cr) and l/x,™) for the 
new system and the part that is discarded, respectively. They replace the original basis |/x,^) and it is 
convenient to choose the latter in such a way that for each S, we can write 

Im,C) = lM,cr)(K)|M,nT-) (16) 

as a direct product, rather than a linear superposition of such products. 

We are now ready to discard the subsystem whose basis vectors are denoted as |^,to). In the unnor- 
malized density matrix p^ — \'4' ^i) {ip ^i\ (whose trace is the probability of observing outcome /x), we ignore 
the deleted subsystem. Namely, we replace the operator |/x,TO)(/x,n| that appears in p^ by a unit matrix 
5rnn and we perform a partial trace on the indices m and n, as we have done when we discarded the 
states of the environment. We thus obtain a reduced density matrix 

Pj. = XI '^^'^t X! ^'*^'^™ ^t,,T-m Im, cr) (a*, t-\ . (17) 



Its elements (/x, cr|p' |/i, r) can be written as 

(/^^)-r = ^ 5Z(^'^™)-« P-t (A;,„)rt, (18) 

m s,t 

where pst = CgC^, and the notation 

was introduced for later convenience. Recall that the indices s and a refer to the original system under 
study and to the final one, respectively. Omitting these indices, Eq. (nS) takes the familiar form 



P^^ 



E^A.mP4m. (20) 



which is the most general completely positive linear map p5[ . This is sometimes written as p' = Sp, where 
S" is a linear superoperator which acts on density matrices (while an ordinary operator acts on quantum 
states). Note however that these superoperators have a very special structure, given by Eq. (20). 

Clearly, the "quantum jump" p ^ p' is not a dynamical process that occurs in the quantum system 
by itself. It results from the introduction of an apparatus, followed by its deletion or that of another 
subsystem. In the quantum folklore, an important role is played by the "irreversible act of amplification." 
The latter is irrelevant to the present issue. The amplification is solely needed to help the experimenter. A 
jump in the quantum state occurs even when there is no detector click or other macroscopic amplification, 
because we impose abrupt changes in our way of delimiting the object we consider as the quantum system 
under study. The precise location of the intervention, which is important in a relativistic discussion [g| , is 
the point from which classical information is sent that may affect the input of other interventions. More 
precisely, it is the earliest spacetime point from which classical information could have been sent. This 
is also true for interventions that gave no detection event. Such a passive intervention is located where 
the detection event would have occurred, if there had been one. 

Is it possible to maintain a strict quantum formalism and treat the intervening apparatus as a quan- 
tum mechanical system, without ever converting it to a classical description? We could then imagine 
not only sets of apparatuses spread throughout spacetime, but also truly delocalized apparatuses Eq], 
akin to Schrodinger cats p7| , E8[ , so that interventions would not be localized in spacetime as required in 
the present formalism. However, such a process would only be the creation of a correlation between two 
nonlocal quantum systems. This would not be a true measurement but rather a "premeasurement" pq| . 
A valid measuring apparatus must admit a classical description equivalent to its quantum description | |22| 
and in particular it must have a positive Wigner function. Therefore a delocalized apparatus is a con- 
tradiction in terms. If a nonlocal system is used for the measurement, it must be described by quantum 
mechanics (no classical description is possible) , and then it has to be measured by a valid apparatus that 
behaves quasi-classically and in particular is localized. It can indeed be localized as well as we wish, if it 
is massive enough. 

Likewise, quantum measurements of finite duration, as discussed by Peres and Wootters B, actually 
are only premeasurements. To obtain consistent results, these authors had to explicitly introduce a second 
apparatus that suddenly measures the first one. Their first apparatus has no classical description. In the 
language that I am using now, only the second apparatus performs a valid measurement. 

In a purely quantum description of the apparatus, which is the one appropriate at the premeasurement 
stage, the new state is an incoherent mixture of various p^ correlated to distinct outcomes p of the 
apparatus. However, the description of the apparatus must ultimately be converted into a classical 
one ||2l|, [22) if we want it to yield a definite record. On the other hand, it is also possible to discard the 
apparatus without recording its result. We then have to describe the state of the quantum system by a 
mixture of mixtures, as in Eq. ( |37| ) below. The term "compound" P9| has been proposed for that kind 
of mixture which is solely due to our ignorance of the actual outcome and has no objective nature. Once 
we have a definite outcome p, the new state is p^, given by Eq. (pOj). 



IV. KRAUS MATRICES 

A special case of Eq. (20) for square matrices A^m was obtained by Kraus [Q who sought to find 
the most general completely positive map for the density matrices of a given quantum system (no change 
of dimensions was allowed). Kraus's result obviously is a generalization of von Neumann's prescription 
for the state resulting from the /x-th outcome of a measurement, namely p^ — P^pPf^, where P^ is the 
projection operator associated with outcome /i. Recall that, even if the initial p is normalized to unit 
trace (as we always assume), the trace of p^ in the above equations is not equal to 1. Rather, it is the 
probability of occurrence of outcome p.. It is quite convenient to keep p^ unnormalized, with the above 
interpretation for its trace. 

The results obtained here are more general than those of Kraus, because the matrices A^m may be 
rectangular. As Eq. (|l9|) shows, these matrices are simply related to the unitary transformation Usfiam 
that generates the premeasurement [ pl| . Superoperators that do not conserve the number of dimensions 
of the density matrix were also considered by other authors [p2|. The present treatment is even more 
general, because it allows the number of rows in A^^ (that is, the order of p'^) to depend on p, since we 
may decide to discard different subsystems according to the outcome of the measurement. 

From Eq. (|l9|), which relates the matrix elements {A^m)us to the unitary transformation involved in 
the quantum intervention, it appears that if we multiply the order of p' by the range of the indices m in 
Afj,m, the product of these two numbers is the same for all p, since it is equal to the number of dimensions 
of the composite system C, namely the original quantum system together with the measuring apparatus. 
However, the situation is more complicated, because it may happen that Af^m ~ for some values of 
pm. Moreover, if the matrices A^m and A^n are proportional to each other for some m and n, these 
matrices can be combined into a single one; and conversely, any A^m can be split into several ones which 
are proportional to each other. Therefore there is no simple rule saying how many terms appear in the 
the sum in Eq. (po|). 

The probability of occurrence of outcome p in a measurement is given by Eq. (o) and it can now be 
written as 



Pm 



-^Tr(^^,„p4„) = Tr(i?^p). (21) 



The positive (that is, non- negative) operators 

^fJ- = 2^ ^tj.m ^f^m, (22) 

m 

whose dimensions are the same as those of the initial p, are elements of a POVM and satisfy ^ E^^ = 1. 
Note that null outcomes (i.e., no detection) have to be included in that sum. They indeed are the most 
probable result in typical experimental setups. Yet, even if no detector is excited, the intervention may 
affect the quantum system p4| , and the corresponding A^m are not trivial. There may even be several 
distinct ^^,„ for "no detection," depending on the cause of the failure. 

Conversely, given E^ (a non-negative square matrix of order k) it is always possible to split it in 
infinitely many ways as in Eq. (p2). This is easily proved by taking a basis in which Ef^ is diagonal. All 
the elements are non-negative, so that by taking their square roots we obtain a matrix \fE^ that satisfies 
the relation required for A^m. Next, let {S'^m} be a set of complex rectangular matrices with k columns 
and any number of rows, satisfying ^Ylim '^um'^A'™ = 1- It follows that A^m = S^m\/E^ satisfies Eq. (|2^). 

Moreover, if a POVM is factorable, namely 

E^, ^ El'') (g> El^\ (23) 

where the indices (1) and (2) refer to two distinct subsystems, then the above construction provides 
factorable Kraus matrices, 

Ai_i,^rnn = A (g) A^„ . (24) 



The operator sum in Eqs. (|20|) and (^2|) now becomes double sums, over the indices m and n. Such 
double sums are indeed needed. If we had simply written, instead of Eq. (gSj), E'^ = i?/i (Xi i?^ , the 



corresponding Kraus matrices would in general not be factorable. Such a POVM, with a single index, is 
called separable and it cannot in general be implemented by separate operations on the two subsystems 
with classical communication between them p^. 

The factorization of a POVM as in Eq. (|23[) is not the most general one. It corresponds to two 
POVMs independently chosen by the two observers. However, the observers may also follow an adaptive 
stategy. After the first one (conventionally called Alice) executes the POVM {E"^ '}, she informs the 
second observer (Bob) of the result, /i say, and then Bob uses a POVM adapted to that result. It will be 
denoted as {Elf!}, with 



E^^ 



2) = l(2) 



V^. (25) 



Note that the chronological order of the Greek indices indicating the outcomes of consecutive measure- 
ments is from right to left, just as the order for consecutive applications of a product of linear operators. 



We then have, instead of Eq. (23), 

i?.M-4'^®^^M' (26) 

whence 

A.^„™ = A«®41- (27) 

A more complicated situation arises when the same system or subsytem is subjected to consecutive 
interventions that depend on the outcomes of preceding interventions. We have, subsequent to the map 
in Eq. (pa. 



p'tj, -^ pit, - zZ ■^''^" P't' ^It^ri- (28) 

n 

The result of these two consecutive maps can be written 

■n,,7n 

where 

It follows that 

n,m m n 

There is no simple relationship between this expression and the antecedent POVM element -E^, unless 
the Kraus matrices B^^^n are chosen in such a way that 

z2 ■^'''M" -^"^M" = 1- (32) 

We then have 



j:^^. 



S^. (33) 



This splitting of E^ into a sum of several parts is called a POVM refinement ||3^. It may be repeated 
many times, until all the final POVM elements are matrices of rank 1. For example, this can be done 
by an apparatus that includes a multidimensional ancilla which is not discarded at intermediate stages 
but only after the completion of the measuring procedure. This is why it is convenient to consider that 
ancilla explicitly rather than as part of the measuring apparatus. In the resulting Neumark extension 
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(that is, the joint Hilbert space of the quantum system and the ancilla) each POVM is implemented 
as an ordinary von Neumann PVM |lj, ^. Initially, the latter is coarse grained: it distinguishes only 
multidimensional subspaces of the system and the ancilla. This PVM is then gradually "refined" by the 
observer who uses further PVMs to select smaller and smaller orthogonal subspaces. How these subspaces 
are explicitly defined depends on results obtained in preceding tests. This is an adaptive strategy which 
is particularly efficient for the optimal identification of an unknown bipartite quantum state p3| . The 
two observers take turns in performing local measurements and informing each other of the results they 
obtained. The final result has again the form (Eq), but now the label /i stands for the entire sequence of 
intermediate outcomes that were obtained by the two observers, and the label v indicates the result of 
the very last intervention. 

For example, if each observer performs just two tests, with consecutive results /i (Alice), u (Bob), a 
(Alice), and r (Bob), then Eq. (E^) becomes 

This relationship is valid for any two pairs of consecutive tests, not only for those of the "refinement" 
type. 

Returning to the case of a single observer, let a complete set of POVM elements be given. It is then 
always possible to construct from their Kraus matrices (in infinitely many ways) unitary transformations 
that satisfy Eq. (||). Indeed, writing all indices explicitly, the relation ^ E^ — 1 becomes 

X! i^t^rnTas {A^m)at = ^st- (35) 

This can be written, owing to Eq. (p^), as 

/ , C^s^trm Utf^am = <5st , (36) 

which is the same as Eq. (y) with the index A (which refers to the composite system C) replaced by 
the composite index sfia. This explicitly shows how any POVM can be implemented by a unitary 
transformation and a suitable apparatus having the necessary dimensionality (usually much smaller than 
the one required by the introduction of an ancilla). 

Recall that the Ap,„ matrices are in general rectangular. For example in the teleportation scenario ||l^ 
Alice can discard her two quantum particles together with her measuring apparatus after she performs 
her 4-way test. The Hilbert space of Alice's subsystem then becomes trivial: it has only one dimension 
(only one state). In that case, A^m has a single row, Ajj^^ a single column, and p^ is just a number, 
namely the probability of obtaining outcome fi. Likewise, each stage of a quantum distillation |34[ pql 
causes a reduction of the number of dimensions of the quantum system that is distilled. That system 
initially consists of a set of entangled subsystems. Successive interventions select suitable subsets that 
have higher degrees of entanglement. Ideally, the final result should be arbitrarily close to a pair of spin- 2 
particles in a singlet state. 

Some authors consider only square matrices A^m and then it is mathematically permitted to sum 
all the pp so as to obtain the average state of all the outgoing quantum systems. For example, if the 
outcomes of our interventions are not recorded, so that no subensembles are selected, we may write 

where the trace of p' is 1, of course. Such sums are rarely needed in theoretical discussions. Different 
labels /i correspond to different world histories (that is, different samples in the ensemble of experiments) . 
Summing over them is like saying that peas and peanuts contain on the average 42% of water, instead of 
saying that peas have 78% and peanuts 6% |gg. Still, there are some cases where this kind of averaging 
is justified. For example, when we compare the expected yields of various distillation methods Q, |35|] , 
we are interested only in average results. Moreover, when the quantum system weakly interacts with an 
unknown environment (such as a heat bath), rather than with an apparatus that can neatly distinguish 
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different outcomes of the intervention, the result is a continuous decoherence of the quantum state |37|| . 
This issue is discussed quantitatively in the next section. 

V. THE LINDBLAD EQUATION 

For a complete treatment of the quantum system in its unknown environment, we write the Hamilto- 
nian as 

H = Ho + Hcnv + Hint, (38) 

with obvious notations. The last two terms in H generate a stochastic, rapidly fluctuating motion. 
The exact evolution, taking everything into account, is a Brownian motion (a kind of random walk) 
superimposed on the ideal motion. Assume that the unperturbed evolution due to Hq is very slow on the 
scale of idocoh, the time needed for Eq. (O) to be valid. One can thus write (combining the composite 
index fim into a single index j) 

A,^S,+F„ (39) 

where the matrices Sj correspond to the slow motion generated by Hq, and the matrices Fj to the fast 
fluctuations due to the environment. This neat decomposition into slow and fast variables involves of 
course some arbitrariness that will be reflected in the derivation of the Lindblad equation below. 

It follows from Eq. (39) that AjpA'- splits into three kinds of terms. Those quadratic in Sj represent 
the smooth evolution due to Hq. If we wish to write a differential equation for dp/dt, the other terms 
have to be smoothed out on a time scale much longer than tdccoh- Since this is a random walk, the terms 
linear in Fj average out to zero on that time scale, and the terms that are quadratic in Fj grow linearly 
in time. Clearly, this smoothing out and the resulting linear growth involve some approximations whose 
validity has to be ascertained on a case by case basis. 

Let us thus assume that there is a coarse time scale 5t ^ tdecoh, long enough so that the fluctuations 
are averaged out, and yet short enough so that the slow evolution due to Hq is negligible beyond first 
order in St. We can then write Fj ~ VjVSi, where the matrices Vj are finite. The result is the Lindblad 
equation [|3) 

dp/dt = z[p, Ho] + Y^iV.pVj - \pV^V, - \V]V,p). (40) 

i 

This equation is of course valid only in the future time direction {dt > 0), because the smoothing out 
of fluctuations entails an irreversible loss of information. Lindblad's original derivation used an abstract 
argument involving complete positivity and a semigroup structure (again dt > 0). An equivalent argument 
was given independently by Gorini et al. ]3q | . The present proof is based on an explicit dynamical model of 
interaction and it may be easier to understand. A similar derivation was also obtained by Schumacher | |39| 
and, after the present article was submitted for publication, a more detailed discussion, also based on 
the Kraus formalism, was published by Bacon et al. [Q. Still more recently, Adler sharpened the above 
heuristic derivation of Eq. (EO) by using Ito's stochastic calculus Wl[. 
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